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EXPRESSION OF FOREIGN GENES FROM PLANT VIRUS VECTORS 

FIELD OF THE INVENTION 

This present invention is related to the field of viral vectors that are capable of 

expressing an open reading frame in a host. In particular, this invention relates to the 
5 use of an internal ribosome entry site, which is inserted into a heterologous virus to 
obtain gene expression. 

BACKGROUND OF THE INVENTION 

Plant virus-based vectors have a number of advantages as gene expression tools 

1 0 including the ability to direct rapid and high-level expression of foreign genes in 
mature, differentiated, plant tissue and have been used for a number of different 
applications. Reporter proteins expressed by viral genomes allow localization of virus 
infected cells [1-3] and can be used to study mutant phenotypes [4, 5]. Plant virus- 
based vectors are also used for the production of valuable foreign peptides and proteins 

15 in plants [6]. 

Plant virus-based vectors offer advantages over other more costly and less 
flexible protein production systems such as fermentation, and much research has 
focused on the development of DNA and RNA viruses as vectors for gene expression 
in plants. Most approaches for the expression of foreign genes from viral vectors rely 

20 on either expression of the foreign protein as a ftision to a viral protein [2, 7, 8], or 
from a duplicated subgenomic mRNA promoter [9, 10]. However, a disadvantage of 
the latter approach is that the duplicated sequence is prone to homologous 
recombination with the consequent loss of the inserted sequence [11]. Translation of 
most eukaryotic mRNAs occurs by the scanning mechanism in which the 408 

25 ribosome subunit binds to a 5' cap structure and then "scans" the mRNA until it 

reaches an AUG translation initiation codon in a favorable sequence context where 
translation begins [12]. 

An alternative to cap-dependent initiation of translation, involving direct 
recruitment of ribosomes to internal tracts within mRNAs, has been observed for some 

30 cellular and viral mRNAs. Specific sequences, termed "internal ribosome entry sites" 

- 1 - 
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(IRES), located upstream of AUG codons have been found to be involved in this 
process, however, the mechanisms of IRES action are not fully understood [13]. IRESs 
have been found in capped as well as uncapped viral RNA, they show no strong 
sequence homology and direct the translation of mRNAs with different functions, 
5 under different physiological conditions. Although some reports of IRES sequences in 
plant viruses have proven controversial [14], Ivanov et al. [15] demonstrated that the 
148 nucleotide sequence (IREScp) upstream of the coat protein (CP) gene of a 
crucifer-infecting tobamovirus (crTMV) is capable of promoting internal initiation of 
translation of the CP in vitro, acting as an IRES. Skulachev et al. [16] subsequently 

1 0 showed that this 148 nucleotide sequence sequence, and sequences originating from 
the region upstream of the movement protein gene in both crTMV and tobacco mosaic 
virus strain Ul, mediated expression of a 3 '-proximal reporter gene in vi\o, on 
transfection of tobacco protoplasts and particle bombardment of A^. benthamiana 
leaves with bicistronic RNA transcripts. Potato Vims X (PVX) is a single stranded 

1 5 RNA virus [17] that has been used successfully as a vector for gene expression in 

plants using both protein fusion and duphcated promoter expression strategies [1, 18]. 

Here we disclose a novel strategy for the expression of proteins in plants using 
viral vectors containing an IRES sequence. The strategy employed exploits the 
observation that, for PVX, cell-to-cell movement is completely dependent on the 

20 presence of viral coat protein [1, 5, 19]. In order to test the ability of the crTMV 

IREScp to direct the expression of an internal open reading frame from a heterologous 
viral vector we have assessed a series of viral constructs that produce a bicistronic 
mRNA carrying the green fluorescent protein (GFP) open reading frame positioned 
upstream of the IREScp sequence and followed by the PVX CP coding sequence. 

25 Using this strategy the infectivity of the viral constructs could be determined by the 

expression of GFP, and cell-to-cell movement, resulting in multicellular infection foci 
expressing GFP, could determine expression of coat protein. 

The constructs described here are useful to direct expression of a foreign 
nucleic acid sequence in a host in the absence of multiple subgenomic promoters in the 

30 virus expressing the foreign nucleic acid sequence. The foreign nucleic acid sequence 
could code for a pharmaceutical protein useflil in protein replacement therapy, or to 
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intervene in a metabolic pathway to improve the nutritional value of a crop or alter the 
oil content of the seed. 

Citation of the above documents is not intended as an admission that any of the 
foregoing is pertinent prior art. All statements as to the date or representation as to the 
5 contents of these documents is based on the information available to the apphcant and 
does not constitute any admission as to the correctness of the dates or contents of these 
documents. 

SUMMARY OF THE INVENTION 

1 0 The present invention provides for a polynucleotide comprising (1) an IRES 

nucleotide sequence, (2) an ORF encoding a peptide of interest, and (3) an ORF 
encoding a viral protein. An IRES nucleotide sequence, or an IRES sequence, is any 
nucleotide sequence that can direct the translation of an ORF within a roRNA, or 
promote internal translation of an ORF. This IRES nucleotide sequence can be the 

15 nucleotide sequence of any IRES found in nature, or part thereof that behaves as any 
IRES found in nature. The ORF encoding a peptide of interest and the ORF encoding a 
viral protein may be transcribed on a single mRNA or transcript or message or 
transcriptional product. The single mRNA or transcript or message or transcriptional 
product may be transcribed from a promoter. This promoter is functional in an 

20 appropriate cell and is operatively linked to 5' of the ORF encoding a peptide of 
interest and the ORF encoding a viral protein. The IRES nucleotide sequence is 
located between the ORF encoding a peptide of interest and the ORF encoding a viral 
protein. The IRES nucleotide sequence is able to direct the translation of whichever 
ORF is 3' to the IRES nucleotide sequence. In one embodiment the polynucleotide 

25 comprises, in an order from 5' to 3': a promoter, an ORF, an IRES nucleotide 

sequence, and another ORF. The ORFs may be the ORF of a peptide of interest or the 
ORF of a viral protein. 

The peptide of interest can be any peptide that can be expressed in an 
appropriate host. The host can be any cell or whole organism. The cell may be part of a 

30 cell culture or tissue culture or tissue or organ or a whole organism. The cell may also 
be a modified cell, such a protoplast cell. The cell may be an animal or plant cell. The 
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peptide may be non-native or foreign to the IRES nucleotide sequence, the viral 
protein, or vector construct, or the host organism. The peptide may a peptide with 
pharmceutically useful properties, such as for protein replacement therapy, or any 
useful biological activity. The peptide may be useful in intervening in a metabolic 
5 pathway of the host organism in order to improve the nutritional content of the host, to 
improve the growth of the host organism, to improve the disease or pest resistance of 
the host, or to alter any other desired characteristic of the host. 

The IRES can be an IRES fi-om any organism as long as that IRES is able to 
function as an IRES within a host. The IRES can be from an animal (such as a 
10 mammal), plant, or a virus. The IRES nucleotide sequence can be a viral IRES. In a 
preferred embodiment the IRES nucleotide sequence is an IRES of a crucifer-infecting 
tobamo virus. 

The viral protein can be a viral coat protein. In a preferred embodiment the 
viral protein is a viral coat protein of a crucifer-infecting tobamovirus. 
15 In one embodiment of the invention, there is a plurality of ORFs and IRES 

nucleotide sequences, each ORE having an IRES nucleotide sequence upstream of the 
ORE. 

The subject polynucleotide may be part of a recombinant viral construct, a 
recombinant viral vector construct, a recombinant virus, or a host infected by a 
20 recombinant virus. The virus may be a plant virus. In a preferred embodiment the 
virus is a plant RNA virus. In a more preferred embodiment the virus is PVX. 



25 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts a schematic representation of the genome organization of the PVX 
wild-type clone, TXS, and its derivatives. ORFs are indicated by boxes with the size of 
their predicted products or the names of the proteins they encode (GFP, green 
fluorescent protein. CP, coat protein). An arrow indicates the promoter that directs 
30 expression of the PVX CP and hairpins indicate stem loops. The internal ribosome 
entry site originating from upstream of the coat protein in the crucifer-infecting TMV 
-4- 
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strain is indicated by IRES. Positions of restriction enzyme sites used to produce the 
constructs are indicated. 

Figure 2A depicts confocal images of inoculated A^. benthamiana leaf tissue, 5 days 
5 post infection, showing single cell infections with TXS.GFPACP. Figure 2B depicts 
confocal images of inoculated N. benthamiana leaf tissue, 5 days post infection, 
showing single cell infections with TXS.GFP-IREScpH-CP. Figure 2C depicts 
confocal images of inoculated A^. benthamiana leaf tissue, 5 days post infection, 
showing multicellular infection foci with TXS.GFP. Figure 2D depicts confocal 

10 images of inoculated N. benthamiana leaf tissue, 5 days post infection, showing 

multicellular infection foci with TXS.GFP-IREScp-CP. Figure 2E depicts confocal 
images of inoculated A/^ benthamiana leaf tissue, 5 days post infection, showing 
multicellular infection foci with TXS.GFP-HIREScp-CP. Figure 2F depicts confocal 
images of inoculated A'! benthamiana leaf tissue, 5 days post infection, showing 

1 5 multicellular infection foci with TXS .GFP-SERIcp-CP. 

Figure 3 depicts the nucleotide sequences, between the Sail and Sad restriction sites, 
which comprises the IRES sequence, of clones TX.GFP-ERESs-CP (SEQ ID NO: 1), 
TXS-HRES-CP (SEQ ID NO: 2), TXS.GFP-SERI-CP (SEQ ID NO: 3), and 
20 TXS.GFP-IRESmp-CP (SEQ ID NO: 4). 

Figure 4 depicts the aUgnment of the nucleotide sequences, between the Sail and Sad 
restriction sites, comprising the IREScp sequence, in the progenitor clone TXS.GFP- 
ERESs-CP (SEQ ID NO: 5) and its derivatives SCI 96 (SEQ ID NO: 6) and SCI 97 
25 (SEQ ID NO: 7). Asterisks ("*") indicate nucleotides that have been substituted. "D" 
indicates nucleotides that have been deleted. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

Definitions and Abbreviations 



-5- 
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Bicistronic means a cistron which contains at least two ORFs in a mRNA 
transcribed from a single promoter whereby at least two peptides are translated from 
the mRNA. 

Plant virus-based vectors means an engineered plant virus which is capable of 
5 expressing a desired protein or trait in a host. 

Expression means transcription, translation, protein synthesis, or any 
combination of transcription, franslation, and protein synthesis. 

Foreign gene means any gene that is not derived from or extracted from or 
native to a vector into which it is inserted. 
1 0 Reporter protein means a protein which when expressed by viral genomes 

allow localization of virus-infected cells. 

Host means a cell, tissue, organ, or organism capable expressing the ORFs of 
the subject polynucleotides. This term is intended to include prokaryotic and 
eukaryotic cells, organs, tissues or organisms, where appropriate. Bacteria, ftmgi, 
1 5 yeast, animal (cell, tissue, organ, or organism), and plant (cell, tissue, organ, or 
organism) are examples of a host. 

Infection mean the abihty of a virus to fransfer its nucleic acid to a host or 
introduce a viral nucleic acid into a host, wherein the viral nucleic acid is replicated, 
viral proteins are synthesized, and new viral particles assembled. The term is also 
20 meant to include the ability of a selected nucleic acid sequence to integrate into a 
genome, chromosome or gene of a host or target organism. 

Internal ribosome entry sites (IRES) means a nucleic acid sequence which 
involving direct recruitment of ribosomes to internal tracts within mRNAs. IRESs are 
an alternative to cap-dependent initiation of franslation. Specific sequences, termed 
25 IRES direct the translation of mRNAs with different functions, under different 
physiological conditions. 

IRESmp means nucleotide sequence (IRESmp) upstream of the movement 
protein (MP) gene of Tobamo virus, IRESmp is capable of promoting internal initiation 
of franslation of the MP RNA in vitro, acting as an IRES. 
30 IREScp, the 148 nucleotide sequence (IREScp) upsfream of the coat protein 

(CP) gene of a crucifer-infecting tobamovims (crTMV) is capable of promoting 
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internal initiation of translation of the CP RNA in vitro, acting as an IRES. This 
sequence, and sequences originating from the region upstream of the movement 
protein gene in both crTMV and tobacco mosaic vims strain Ul, mediated expression 
of a 3 '-proximal reporter gene in vivo, on transfection of tobacco protoplasts and 
5 particle bombardment of A^. benthamiana leaves with dicistronic RNA transcripts. 

Heterologous viral vector means an engineered virus, which is not the origin of 
the IRES into which it is inserted. Similarly an IRES sequence which is heterologous 
to the viral vector is one which does not derive from the virus. 

ORF or open reading frame means a nucleotide sequence encoding a series of 
10 sense codons that lack a terminating codon. The ORF may be encoded in any nucleic 
acid, including DNA or RNA, and the nucleic acid may be any form, including single- 
stranded or double-stranded. An ORF may encoded a peptide that is expressed and 
may be a gene. 

15 The Invention 

The present invention provides for a polynucleotide comprising (1) an IRES 
nucleotide sequence, (2) an ORF encoding a peptide of interest, and (3) an ORF 
encoding a viral protein. An IRES sequence, is defined as any nucleotide sequence that 
can direct the translation of an ORF within a mRNA, or promote internal translation of 

20 an ORF. One method by which an IRES nucleotide sequence may direct the translation 
of an ORF within a mRNA, or promote internal translation of an ORF, is by direct 
recruitment of ribosomes to internal tracts within mRNAs. An IRES sequence can be 
the nucleotide sequence of any IRES found in nature, or part thereof that behaves as 
any IRES fovmd in nature. An IRES sequence can also be any nucleotide sequence that 

25 is synthetic or artificially designed that can direct the translation of an ORF within a 
mRNA. One of ordinary skill in the art can by performing the present experiments 
disclosed, with alterations of the IRES sequence, determine whether any nucleotide 
sequence can direct the franslation of an ORF within a mRNA in an appropriate host. 
An IRES sequence can comprise the nucleotide sequence of SEQ ID NO: 1, SEQ ID 

30 NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, or SEQ ID NO: 6, or any 
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fragment thereof that can direct the translation of an ORF within a mRNA in an 
appropriate host. 

The ORF encoding a peptide of interest and the ORF encoding a viral protein 
may be transcribed on a single mRNA or transcript or message or transcriptional 
5 product. The single mRNA or transcript or message or transcriptional product may be 
transcribed from a promoter. This promoter is functional in an appropriate cell and is 
operatively linked to 5' of the ORF encoding a peptide of interest and the ORF 
encoding a viral protein. The IRES nucleotide sequence is located between the ORF 
encoding a peptide of interest and the ORF encoding a viral protein. The IRES 

10 nucleotide sequence is able to direct the translation of whichever ORF is 3' to the IRES 
nucleotide sequence. In one embodiment the polynucleotide comprises, in an order 
fi-om 5' to 3': a promoter, an ORF, an IRES nucleotide sequence, and another ORF. 
The ORFs may be the ORF of a peptide of interest or the ORF of a viral protein. 
The peptide of interest can be any peptide that can be expressed in an 

1 5 appropriate host. The host can be any cell or whole organism. The cell may be part of a 
cell culture or tissue culture or tissue or organ or a whole organism. The cell may also 
be a modified cell, such a protoplast cell. The cell may be an animal or plant cell. The 
peptide may be non-native or foreign to the IRES nucleotide sequence, the viral 
protein, or vector construct, or the host organism. The peptide may a peptide with 

20 pharmceutically useful properties, such as for protein replacement therapy, or any 
useful biological activity. The peptide may be useful in intervening in a metabolic 
pathway of the host organism in order to improve the nutritional content of the host, to 
improve the growth of the host organism, to improve the disease or pest resistance of 
the host, or to alter any other desired characteristic of the host. 

25 The IRES can be an IRES from any organism as long as that IRES is able to 

function as an IRES within a host. The ERES can be from an animal (such as a 
mammal), plant, or a virus. The IRES nucleotide sequence can be a viral IRES. In a 
preferred embodiment the IRES nucleotide sequence is an IRES of a plant virus. In a 
more preferred embodiment the IRES nucleotide sequence is an IRES of a plant RNA 

30 vims. In an even more preferred embodiment the IRES nucleotide sequence is an IRES 
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of a tobamovirus. In an even further more preferred embodiment the IRES nucleotide 
sequence is an IRES of a crucifer-infecting tobamovirus. 

The viral protein can be a viral coat protein. In a more preferred embodiment 
the viral protein is a viral coat protein of a plant virus. In an even more preferred 
5 embodiment the viral protein is a plant viral coat protein. In an even further more 
preferred embodiment the viral protein is coat protein of a crucifer-infecting 
tobamovirus. 

The subject polynucleotide can comprise of the following components in the 
following order: 
10 P-N-R-V 

or 

P-V-R-N 

wherein "P" is the promoter, "N" is the ORF of the peptide of interest, "V" is the ORF 
of the viral protein, and "R" is the IRES nucleotide sequence. In addition the subject 
1 5 polynucleotide may have additional nucleotide sequences between each component 
and/or flanking the components, as long as these nucleotide sequences do not interfere 
with the transcription or translation or expression of the ORF of the peptide of interest 
and the ORF of the viral protein. 

In another embodiment of the invention, there is a plurality of ORFs and IRES 
20 nucleotide sequences, in which the subject polynucleotide can comprise of the 
following components in the following order: 

P-N-R-V-Ri-Ni-Rz-Nj -. . .-R^-N„ 
or 

P-V-R-N-R1-N1-R2-N2 -. . .-R^-N„ 

25 wherein "P", "N", "R" and "V" are as defined above and "R„" is any IRES nucleotide 
sequence (as exemplified by "R," and "Rj" above) and "N„" is any ORF of any desired 
viral or non-viral peptide (as exemplified by "N/' and "Nj" above), where "n" is an 
integer equal to one or greater. In this embodiment, there is a plurality of ORFs and 
IRES nucleotide sequences: each ORF having an IRES nucleotide sequence upstream 

30 of the ORF. In addition the subject polynucleotide may have additional nucleotide 

sequences between each component and/or flanking the components, as long as these 
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nucleotide sequences do not interfere with the transcription or translation or expression 
ofany ORF. 

The subject polynucleotide can be part of a plasmid, vector, vector construct, 
viral vector construct, recombinant viral construct, or part of a viral genome. The 
5 subject pol5Tiucleotide or aforementioned plasmid, replicon, episome, vector, vector 
construct, viral vector construct, recombinant viral construct, or viral genome can be 
part of a virus, recombinant vims, viral particle, virion or the hke. The virus, 
recombinant virus, viral particle, or virion can infect an appropriate host. 

The virus may be a plant virus. In a preferred embodiment the virus is a plant 
1 0 RNA virus. In a more preferred embodiment the virus is a plant single-stranded RNA 
virus. In an even more preferred embodiment the virus is Potato Virus X ("PVX"). 
PVX is a single-stranded RNA virus that has been used successfully as a vector for 
gene expression in plants using both protein fusion and duplicated promoter expression 
strategies. 

15 In one embodiment of the invention, an IRES is capable of directing the 

expression of an internal ORF in a heterologous viral vector. 

In another embodiment of the invention, a viral vector construct is described 

which produces a bicistronic mRNA carrying an ORF positioned upstream of an IRES 

sequence and followed by a coat protein coding sequence. The ORF can be any gene 
20 to be expressed in a host, or it may be a reporter gene ORF by which the progress of 

coat protein expression and viral movement can be monitored. 

In another embodiment, a viral vector construct, comprises: (1) the genome of a 

virus, and (2) an IRES sequence which is heterologous to the virus, the IRES sequence 

being inserted into the virus downstream of any desired gene or ORF and upstream of 
25 a virus coat protein gene, wherein the IRES sequence being inserted in the sense or 

antisense orientation. This viral construct might be one that produces a bicistronic 

mRNA. 

In another embodiment, a viral vector construct, comprising: (1) the genome of 
a virus, (2) an IRES sequence which is heterologous to the virus, the IRES sequence 
30 being inserted into the virus downstream ofany desired gene or ORF and upstream of 
a virus coat protein gene, wherein the IRES sequence being inserted in the sense or 
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antisense orientation; and, (3) a stable stem loop structure inserted 5' of an IRES 
sequence. Such a viral vector construct might give rise to one or more single cell 
infection sites or to systemic-host infection. A viral vector construct comprising a stem 
loop structure functions as a site for direct recruitment of ribosomes for initiation of 
5 translation. The stem loop structure can be immediately upstream of the IRES 

sequence. A viral vector construct having a stem loop immediately upstream of the 
IRES sequence leads to a reduction in coat protein expression levels. It may be useful 
to control of rate of system infection. 

In another embodiment a viral vector construct comprises: (1) the genome of a 

10 virus; (2) an IRES sequence which is heterologous to the virus, the IRES sequence 

being inserted into the virus downstream of any desired gene or ORE and upstream of 
a virus coat protein gene, wherein the IRES sequence is inserted in the sense or 
antisense orientation; and, (3) a stable stem loop structure inserted 3' of an IRES 
sequence. This construct is capable of preventing expression of the downstream CP 

15 ORE in the bicistronic mRNA. It is also able to effectively block scanning ribosomes. 

The invention is exemplified by viral constructs comprising the nucleotide 
sequences of TXS, TXS.GPF, TXS.GPF-ACP, TXS.GPF-IRES-CP, TXS.GPF-IRESs- 
CP,TXS.GPF-HIRES-CP, TXS.GPF-IRESH-CP, TXS.GPF-SERI-CP, and TXS.GFP- 
IRESs(mp)-CP. 

20 The provides for a virus, recombinant virus, viral particle, virion or the like 

comprising any of the aforementioned viral vector constructs or viral constructs. 

The following examples further illustrate the present invention. These 
examples are intended merely to be illustrative of the present invention and are not to 
be construed as being limiting. 



EXAMPLES 

Potato virus X (PVX)-based vector constructs were generated to investigate the 
use of an internal ribosome entry site (IRES) sequence to direct translation of a viral 
gene. An IRES sequence from a crucifer-infecting strain of tobacco mosaic virus was 
30 used to direct expression of the PVX CP. The IRES was inserted downstream of the 

- 11 - 
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gene encoding GFP and upstream of the PVX CP, in either sense or antisense 
orientation, such that CP expression depended on ribosome recruitment to the IRES. 
Stem loop structures were inserted at either the 3'- or 5 '-end of the IRES sequence to 
investigate its mode of action. In vitro RNA transcripts were inoculated to Nicotiana 
5 benthamiana plants and protoplasts, levels of GFP and CP expression were analysed 
by ELISA and the rate of virus cell-to-cell movement was determined by confocal 
laser scanning microscope imaging of GFP expression. PVX CP was expressed, 
allowing cell-to-cell movement of virus, from constructs containing the IRES sequence 
in either sense or antisense orientation, and from the construct containing a stem loop 
10 structure at the 5 '-end of the IRES sequence. No CP was expressed from a construct 
containing a stem loop at the 3 '-end of the IRES sequence. Our results suggest that the 
IRES sequence is acting in vivo to direct expression of the 3 '-proximal ORF in a 
bicistronic mRNA thereby demonsfrating the potential of employing IRES sequences 
for the expression of foreign proteins from plant virus-based vectors. 

15 

Biological Deposits 

The following plasmid vectors were deposited with ATCC, 10801 University 
Blvd., Manassas, Virginia 201 10, USA on November 22, 2000: pHIRES-XCP 
(accession no. PTA-2717), pSERI-XCP (accession no. PTA-2718), pIRES-XCP 
20 (accession no. PTA-2719), pIRESx-XCP (accession no. PTA-2720), pTXS.GFP- 
IRESH-CP (accession no. PTA-2721), pTXS.GFP-SERI-CP (accession no. PTA- 
2722), pTXS.GFP-HIRES-CP (accession no. PTA-2723), pTXS.GFP-IRES-CP 
(accession no. PTA-2724), pTXS.GFP-IRESs-CP (accession no. PTA-2725), and 
pTXS.GFP-IRESs(mp)-CP (accession no. PTA-2726). 

25 

EXAMPLE 1. 
Plasmid constructions. 

The plasmid pTXS, carrying a cDNA of the wild-type PVX genome, 
pTXS.GFP, in which GFP expression is under the franscriptional confrol of a 
30 duplicated subgenomic mRNA promoter, and pTXS.GFP-ACP, in which the CP 
sequence is deleted, have been described previously [5]. The plasmid pIRES-GUS 
- 12- 
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carries the IREScp sequence positioned upstream of the P-glucuronidase (GUS) open 
reading frame [15]. 

Construction of pIRES-XCP. 
5 PGR amplification was performed using pTXS as template with an upstream 

mutagenic primer designed to introduce an Ncol restriction site across the initiating 
AUG codon of the viral CP RNA and the universal reverse primer as the downstream 
oligonucleotide. Following digestion of the ampUfication product with Ncol and Sad, 
the resulting fragment, which spans the CP RNA sequence and 3 '-untranslated region 
10 of PVX, was inserted into pIRES-GUS from which the GUS coding sequence had 
been removed by digestion with Ncol and Sad. The resulting plasmid, pIRES-XCP 
was the basis for all subsequent plasmid constructions. 

Construction of pIRESs-XCP. 

1 5 PCR amplification of pIRES-XCP was performed with a non-mutagenic 

upstream primer, that introduced an E'coRI site, and a mutagenic downstream primer 
that introduced a Sad site between the 3 ' end of the IREScp sequence and the Ncoi 
site. The amplification product was digested with EcoRi and Ncol prior to cloning of 
the fragment between the same sites, flanking the IREScp sequence, of pIRES-XCP to 

20 produce pIRESs-XCP. 

Construction of pSERI-XCP with anti-sense IRES . 

PCR amplification of pIRES-XCP was performed with mutagenic primers that 
25 introduced Ncol and EcoBl sites at the 5' and 3' ends of the IREScp sequence, 
respectively.. The amplification product was digested with Ncol and -EcoRI prior to 
cloning of the amplified fragment in reverse orientation between the same sites of 
pIRES-XCP to produce pSERI-XCP. 

30 Construction of pHIRES-XCP with Stem Loop. 



- 13 - 
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Sequence encoding a stem loop was introduced 5' of the IREScp sequence by 
digestion of pIRES-XCP with EcoRI and Hgation to a self-annealed oligonucleotide 
(5'-AAT-TCG-GAT-CCC-GGG-GGG-CCC-TAC-CGC-CGC-GGC-GGT-TAA-CCG- 
CCG-CGG-CGG-TAG-GGC-CCC-CCG-GGA-TCC-G-3') (SEQ ID NO: 8) producing 
5 pHIRES-XCP. 

Construction of full-length clones. 

Full-length clones were produced by digestion of the subclones pIRES-XCP, 
1 0 pIRESs-XCP, pSERI-XCP and pHIRES-XCP with Sail and Spel, and cloning of the 
released firagments, encompassing the IREScp sequence, CP and 3 'imtranslated region 
of PVX, between the same sites of pTXS.GFP to produce 

pTXS.GFP-IRES-CP, pTXS.GFP-IRESs-CP, pTXS.GFP-SERI-CP and pTXS.GFP- 
HIRES-CP respectively. 

15 

Construction of pTXS.GFP-IRESH-CP. 

The plasmid pHIRES-XCP was digested with EcoRI to release the fragment 
encoding the stem loop and, after T4 DNA polymerase treatment to fill in the 
20 overhangs, this fragment was cloned into pTXS.GFP -IRESs-CP that had been digested 
with Sad and T4 DNA polymerase treated. The resulting plasmid, pTXS.GFP-IRESH- 
CP, thus contained the sequence encoding the stem loop 3 ' of the IREScp sequence. 

Construction of pIRESsCmpVXCP. 

25 

A fragment encompassing nucleotides 4800 to 4874 was amplified from a full- 
length turnip vein clearing virus cDNA clone (Lartey, R.T., Voss, T.C., Melcher, U. 
Gene 1995. 166(2):331-2; Lartey, R.T., Lane, L.C., Melcher, U. Arch. Virol. 1994. 
138(3-4):287-98) using mutagenic primers that introduced EcoRI, and Sad and Ncol 
30 sites at the 5' and 3' ends respectively. The amplification product was digested with 
EcoRI and Ncol prior to cloning between the same sites of pIRES-XCP to produce 
pIRESs(mp)-XCP. 
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Construction of full length clone pTXS.GFP-IRESs(mpVCP. 

A full-length clone was produced (as with the other subclones) by digestion 
with Sail and Spel, and cloning of the released fragment between the same sites of 
pTXS.GFP to produce pTXS.GFP-IRESs(mp)-CP. 

5 

In vitro transcription and plant inoculation. 

All plasmids were linearized with Spe\ prior to in vitro transcription using a T7 
mMESSAGE mMACHINE™ kit (Ambion, Austin, TX) according to the 
manufacturer's instructions. Transcription reaction products were inoculated directly to 
10 aluminum oxide dusted leaves ofN. benthamiana as described previously [5]. Two 
leaves were inoculated per plant, and each leaf was inoculated with the transcript 
products derived from 0.2 |xg plasmid template. 



Detection of fluorescence and measurement of infection foci. 

1 5 Leaves were viewed under UV illumination (365 nm) generated from a Blak 

Ray BIOO-AP lamp (Ultraviolet Products, San Gabriel, CA). For confocal imaging 0.5 
cm^ squares of leaf tissue were cut, mounted in water and imaged using an MRC 1000 
confocal laser-scanning microscope (Bio-Rad, Hercules, CA) as described previously 
[5]. The size of fluorescent infection foci was measured using COMOS software (Bio- 

20 Rad). An analysis of variance was carried out on area measurements of 20 separate 
infection foci for each construct, 5 days post inoculation (dpi). The least significant 
difference at a 5 % level was calculated and used to identify groups of data showing 
statistically significant differences. 



25 Protoplast preparation and transfection. 

N. benthamiana plants (4-5 weeks old), grown for at least 10 days in a 
controlled envirormient room (16 h light, 22°C), were used for preparation of 
mesophyll protoplasts according to the method of Power and Chapman [20]. 
Approximately 6x10^ protoplasts were electroporated with 10 |j.L of an in vitro 

30 transcription reaction as previously described [21]. Electroporated protoplast samples 
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were incubated at approximately 20°C with 16h light (200 lux) and harvested after 48 

h for protein quantification by ELISA. 

ELISA. 

Accumulation of GFP and CP in transfected protoplasts was determined by 
5 indirect triple antibody sandwich ELISA essentially as described by Clarke and Bar- 
Joseph [22]. ELISA plate wells (MaxiSorp, Nalgene Nunc International, Rochester, 
NY) were coated with a monoclonal antibody raised against either GFP (antibody 3E6, 
Molecular Probes, Eugene, OR) or PVX CP (antibody MACS 8). Protoplasts from 
triplicate wells of a tissue culture plate were pooled, pelleted and the pellets ground in 

10 400 fi,l PBS. Supematants collected after brief centrifiigation were used for ELISA. 
Bound CP and GFP were probed for with polyclonal antiserum raised against either 
PVX (Adgen, Auchincruive, U.K.) or GFP [18] and subsequently alkahne phosphatase 
conjugated antibody raised against rabbit IgG (Sigma, St Louis, MO). All antibodies 
were diluted 1:2000 for use. The ELISA reaction product (p-nitrophenyl) was 

15 quantified colorimetrically. Levels of CP and GFP were calculated from standards 
using Biolinx"^^ software (Dynatech Laboratories, Chantilly, VA). 



Results. 

Five different IRES-containing constructs were produced (Figure 1). In 
20 pTXS.GFP-IRES-CP, the IRES was introduced in the sense orientation between the 
GFP and CP coding sequences in order to allow the synthesis of a bicistronic 
subgenomic mRNA with coat protein expression dependent on recruitment of 
ribosomes to the IREScp sequence. A derivative of this clone, pTXS.GFP-IRESs-CP, 
with a unique Sad restriction enzyme recognition site between the IRES sequence and 
25 the PVX CP gene, was prepared to aid subsequent plasmid constructions. 

In the clone pTXS.GFP-HIRES-CP, sequence encoding a stem loop structure 
(AG -90 kcal/mol), was inserted between the 3 '-end of the GFP gene and the 5 '-end of 
the IRES sequence, in order to block leaky scanning of ribosomes through the gfp 
gene. In pTXS.GFP-IRESH-CP the stem loop structure described above was 
30 positioned between the 3' end of the IRES sequence and the CP coding sequence, in 
order to block scanning ribosomes and prevent translation of the CP. Thus, if the 
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crTMV-derived sequence was acting as an IRES, the introduction of the stem loop at 
the 3 '-end but not the 5 '-end of the IRES would be expected to block CP expression. In 
the plasmid pTXS.GFP-SERI-CP, the IRES was placed in the opposite orientation in 
the expectation that IRES activity would be blocked and CP expression would not 
5 occur. 

In vitro run-off transcripts, synthesized from the above plasmids and the 
control plasmids pTXS.GFP and pTXS.GFPACP, were infectious as determined by the 
expression of GFP, giving rise either to individual GFP expressing cells (TXS.GFP- 
IRESH-CP, pTXS.GFPACP) or multicellular infection sites (pTXS.GFP-IRES-CP, 
10 pTXS.GFP-IRESs-CP, pTXS.GFP-HIRES-CP, pTXS.GFP-SERI-CP and pTXS.GFP, 
Figure 2). 

Area measurements of multicellular, fluorescent, infection foci obtained with 
the different CP-expressing constructs, representative examples of which are shown in 
Figure 2, indicated that the size of infection foci varied between the different 

15 constructs. The mean area of 20 infection foci, measured for each construct showing 
cell-to-cell movement at 5 dpi are shown in Table 1. Statistical analysis of the data 
showed that all the IRES -containing constructs displayed significantly slower cell-to- 
cell movement than the control TXS.GFP, in which the CP is translated directly from 
the 5 '-end of a subgenomic mRNA. Furthermore, the infection foci produced by 

20 TXS.GFP-HIRES-CP were significantly smaller than those produced by TXS.GFP- 
IRES-CP, TXS.GFP-IRESs-CP and TXS.GFP-SERI-CP (Table 1). 
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TABLE 1. Measurement of fluorescent infection foci.* 



inoculimi 


mean area of fluorescent 
infection foci (mm^) 


TXS.GFP 


4.90 


TXS.GFP-IRES-CP 


0.516 


TXS.GFP-IRESs-CP 


0.514 


TXS.GFP-SERI-CP 


0.461 


TXS.GFP-HIRES-CP 


0.257 


TXS.GFP-IRESH-CP 


0.0 


TXS.GFP-IRESs(mp)-CP 


0.271 



* The areas of fluorescent lesions on inoculated leaves of A^. benthamiana were 
measured 5 dpi. Mean areas in mm^ are presented. Analysis of variance gave a least 
5 significant difference at the 5% level of 0. 107. 

The constructs were further analyzed in transcript-inoculated N. benthamiana 
protoplasts with TXS, TXS.GFP, and TXS.GFPACP as controls. The levels of GFP 
and CP accumulation in infected protoplasts were measured by ELISA at 2 dpi (Table 

1 0 2). The results from this quantitative study of CP and GFP accumulation correlated 
well with the in planta observations. All of the constructs carrying the gfp gene, with 
the exception of TXS.GFPACP, gave rise to similar levels of GFP accumulation, while 
levels of CP expression were more variable. There was no detectable CP expression in 
infections with TXS.GFP-IRESH-CP, as expected from the in planta experiment 

15 where no cell to cell movement was observed. The other IRES-containing constructs 
accumulated lower levels of CP than the TXS and TXS.GFP controls. The construct 
TXS.GFP-SERI-CP accumulated similar levels of CP to constructs TXS.GFP-IRES- 
CP and TXS.GFP-IRESs-CP. In addition, TXS.GFP-HIRES-CP, which showed 
sUghtly reduced cell to cell movement on plants relative to the other IRES containing 

20 constructs, showed the lowest level of CP accumulation in protoplasts out of the 
constructs that produced any CP. 
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TABLE 2. Green fluorescent protein and coat protein accumulation in N. benthamiana 
protoplasts.** 



Inoculum 


ng GFP / i^g TSP 


ng CP / |ig TSP 


TXS 


0.0 


0.492 


TXS.GFP 


0.0369 


0.442 


TXS.GFP-ACP 


0.0603 


0.0 


TXS.GFP-IRES-CP 


0.0360 


0.1518 


TXS.GFP-IRESs-CP 


0.0342 


0.1080 


TXS.GFP-SERI-CP 


0.0415 


0.1081 


TXS.GFP-HIRES-CP 


0.0347 


0.0504 


TXS.GFP-IRESH-CP 


0.0428 


0.0 


MOCK 


0.0 


0.0 



** Protoplasts were inoculated with transcripts and levels of GFP and CP 
5 accumulation were assayed by ELISA, 2 dpi. TSP = total soluble protein. 

EXAMPLE 2. 

Method for improving/screening IRES activity. 

To mutate the IREScp sequence in pTXS.GFP-IRESs-CP the plasmid was used 

1 0 in mutagenic PCR basically as described by Leung et al [25] with a 5 ' primer 

equivalent to the 3' end of the GFP gene (GES) and a 3' primer complementary to the 
5' end of the PVX coat protein gene (N3#4). The products of a first round of 
mutagenic PCR were used as template in a second roimd of mutagenic PCR. The 
amphfication products from the second round PCR were digested with Sail and Sad 

15 and ligated into the progenitor plasmid, digested with the same enzymes, in place of 
the unmutated IREScp sequence. After transformation into E. coli, DNA was prepared 
from a population of circa 100,000 independent clones and used as template in a 
fiirther two rounds of mutagenic PCR. The final amphfication products were digested 
with Sail and Sad and hgated into Hke cut pTXS.GFP-IRESs-CP. After 

20 fransformation into E. coli, DNA was prepared from a population of circa 200,000 
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independent clones. Further, DNA was prepared from six individual clones for 
nucleotide sequence determination of the mutated sequence, which indicated a base 
mutation rate of approximately 7%. 

To select clones with modified IRES activity the DNA population was 
5 linearized with Spel and transcripts inoculated to N. benthamiana. Plants were 

inspected at 7dpi under UV illumination and two variant lesions displaying increased 
fluorescence / larger fluorescent lesion area selected. The lesions were excised and 
RNA extracted [1]. The RNA was reverse transcribed according to the manufacturers 
instructions with Superscript II (GIBCO BRL, Paisley, U.K.) and a primer 

1 0 complementary to the 5 ' end of the P VX CP gene (N3#4). The first strand cDNA 

products were purified and ampUfied through PGR using a primer equivalent to the 3' 
end of the GFP gene and a primer complementary to the 5' end of the PVX CP gene. 
The amplification products, encompassing the mutated IREScp sequence, were 
digested with Sail and Sad prior to cloning between the same sites of pTXS.GFP- 

15 IRESs-CP producing SC196 and SCI 97. 

To test whether clones with enhanced rates of cell to cell movement had been 
obtained, resulting from increased levels of IREScp activity, DNA was linearized and 
transcribed. The transcripts were inoculated to one half of A^. benthamiana leaves. 
Transcripts from the unmodified progenitor clone were inoculated to the other halves 

20 of the leaves as a control. Lesion area measurements were performed as described in 
methods. The mean lesion areas were calculated for the sample and control 
inoculations on paired half leaves (Table 3). Two-sample T-tests showed that both of 
the selected clones produced significantly larger lesions than the unmodified control (p 
> 0.1). The nucleotide sequence of the mutated IREScp sequence found in the clones 

25 SC 1 96 and SC 1 97 was determined (Figure 4). Alignment of the determined sequences 
with the unmodified sequence of the progenitor clone, pTXS.GFP-IRESs-CP, between 
the Sail and Sad sites used for cloning of the mutated fragment showed base mutation 
rates of 6.7 and 5.9% respectively (Figure 3). 

30 
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TABLE 3. Measurement of fluorescent infection foci.*** 



Inoculum 


Mean area of fluorescent 
infection foci (mm^) 


SC196 


1.10 


TXS.GFP-IRESs-CP 


0.492 






SCI 97 


1.74 


TXS.GFP-IRESs-CP 


0.547 



*** The areas of fluorescent lesions on inoculated half leaves of N. benthamiana were 
measured at 8 dpi. Mean areas in mm^are presented for paired half leaves inoculated 
5 with samples and control. 

Most approaches to foreign gene expression using virus-based vectors have 
relied either on the S3nithesis of a polyprotein, which is proteolytically processed to 
release the foreign protein, or depend on a viral promoter to direct expression of a 

10 foreign gene at the 5 '-end of a subgenomic mRNA. Here we have investigated the use 
of internal initiation of translation as an alternative approach to the expression of genes 
from virus vectors. The use of an IRES sequence to direct gene expression avoids the 
need to duplicate promoter sequences in gene insertion, circumventing the possibility 
of homologous recombination. We used PVX-based vector constructs that produce a 

15 bicistronic mRNA containing the IREScp sequence previously described by Ivanov et 
al. [15] in which expression of the 3 '-proximal CP gene was dependent on internal 
ribosome entry (Figure. 2, Tables 1 and 2). All constructs carrying the gfp gene 
accumulated similar levels of GFP with the exception of TXS.GFPACP, which 
accimiulated two- to three-fold more GFP than the other vectors. This observation is 

20 most likely a consequence of the higher transcriptional activity of the single, 3'- 

proximal, subgenomic promoter present in this vector. 

A stable stem loop structure inserted at the 5' end of the IREScp sequence did 

not abohsh expression of the CP indicating that leaky scanning of ribosomes through 

the gfp gene was unlikely. In addition a construct in which the IREScp sequence was 
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completely deleted gave rise to single cell infection sites further suggesting that leaky 
scanning was not occurring. In contrast to the situation with the stem loop positioned 
5' to the IREScp the stem loop structure inserted 3' to the IREScp sequence prevented 
expression of the downstream CP open reading frame in the bicistronic mRNA. Thus 
5 the stem loop was able to effectively block scanning ribosomes. These results indicate 
that the IREScp sequence is functioning as a site for direct recruitment of ribosomes 
for initiation of translation. Our results support observations by Skulachev et al [16] 
that 3 '-proximal gene expression was obtained from bicistronic transcripts, separated 
by the IREScp sequence, even when translation of the first gene was abohshed by a 

1 0 stem loop structure inserted upstream of the 5 '-proximal open reading frame. However, 
in our experiments the presence of the stem loop immediately upstream of the IRES 
sequence led to a reduction in CP expression levels (Figure 2, Tables 1 and 2). This 
observation could be explained if the upstream loop sequence interfered with direct 
ribosome landing at the IRES, a conclusion consistent with the known importance of 

1 5 tertiary structure in some animal viral and cellular IRES sequences [11], resulting in 
reduced levels of CP expression. 

Alternatively, reinitiation of translation, a phenomenon previously reported to 
occur with low efficiency in plants [23], could explain translation of the downstream 
ORE in constructs carrying the putative IRES sequence. In this scenario actively 

20 translating ribosomes might be capable of inefficiently melting the loop structure prior 
to reaching the GFP stop codon. That reinitiation of translation is resulting in 
expression of the CP is argued against by the fact that the levels of accumulation of 
CP, encoded by the 3' cistron of the bicistronic mRNA, are higher than those of GFP, 
though, this does not take into account possible differences in protein stability between 

25 the GFP and CP. Also, it has been shown that IRES sequences themselves can be 
strong inhibitors of translation by the scanning mechanism, due to their highly 
structured nature [24]. Surprisingly, the IREScp sequence appears to initiate 
translation effectively in either orientation indicating that either the IRES activity is 
not orientation specific or that there may be a functional structure that is conserved in 

30 both strands. An ahemative explanation is that there is a cryptic promoter element 
within the IREScp sequence that is able to initiate transcription of an additional 
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subgenomic RNA, however, we obtained no evidence of extra subgenomic RNAs in 
northern blotting experiments and there is no reason to expect a tobamovirus-derived 
sequence to function as a promoter when present in a heterologous virus. 

This ability of the IREScp sequence to function in either orientation raises the 
5 possibihty that this sequence does not function in the same fashion as previously 

described IRES sequences fi-om animal viruses [1 1]. Further experiments are required 
to define precisely the mechanism(s) by which the IREScp sequence acts. However, it 
is possible that both internal initiation and re-initiation of translation are operating 
concurrently, and when one system is abolished the other is able to continue to drive 
1 0 translation of the downstream gene. 

The data presented demonstrates using a plant virus-derived IRES to direct 
translation of a protein from a heterologous viral vector. 

Although the invention has been described with reference to the presently 
1 5 preferred embodiments, it should be understood that various modifications can be 
made without departing firom the spirit of the invention. 

All publications, patents, patent applications, and web sites are herein 
incorporated by reference in their entirety to the same extent as if each individual 
patent, patent application, or web site was specifically and individually indicated to be 
20 incorporated by reference in its entirety. 
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